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ANALYSES OF CALIFORNIA FARM INCOME RELATIONSHIPS 


by 


2/ 


Gary fienes’’ and Irving Hoch— 


Introduction 


This study is concerned with the investigation of relationships for rural 
farm income; this includes examination of the spatial configuration of income 
and its changes over time, a comparison of farm and nonfarm income over time, 
and the relation of income distributions to explanatory variables. In the proc- 
ess of this investigation, a number of methodological tools have been forged; 
it is hoped that these will have further application. 

There are two major phases in this study. They can be summarized as 
follows: (1) investigation of regional patterns of farm income on the basis. 
of county averages and (2) development of a multivariate income distribution 
determination model, using county income distributions. 

In the first phase, average incomes by county were calculated from data 
appearing in the 1950 and 1960 Census of Population. 

A computer program was then developed to organize and plot the county 
data on a map of California. The program organized the average income data 
into high, medium, and low categories on the basis of well-defined criteria. 


1/ Formerly Assistant Specialist in the Agricultural Experiment Station 
and on the Giannini Foundation, University of California, Berkeley; presently 
Agricultural Economist with the Pacific Southwest Forest and Range Experiment 
Station, Berkeley. 


2/ Formerly Associate Professor of Agricultural Economics and Associate 
Agricultural Economist in the Experiment Station and on the Giannini Founda- 
tion, University of California, Berkeley; presently with Resources for the 
Future, Inc., Washington, D. C. 





These organized data were then presented in the form of maps exhibiting each 
category in turn. (The program can be extended to handle a greater number 
of categories or other spatial entities.) The first phase of this study also 
included some investigation of the relation of rural farm to nonfarm income 
over time and some examination of growth in income over time. No marked dif- 
ference in income between farm and nonfarm sectors emerged at the county 
level; but marked regional differences exist and persist through time, though 
some reduction in these income differentials appears to be occurring. Hence, 
the problem of income disparity seems more a regional than a sectorial prob- 
lem in the context of California. 

In the second phase of this study, a multivariate analysis was carried 
out for rural farm income distributions in an investigation of underlying 
factors affecting income. 

Data on county income distributions for the year 1949 appear in the 1950 
Census of Population and for the year 1959, in the 1960 Census of Population. 
For simplicity, these sets of data will be referred to hereafter as the 1950 
and 1960 data, respectively. For 1950 there were 14 income classes, and for 
1960 there were 11 income classes. For each income class then, there were 
observations on the fraction of income units falling in that class by county. 
Treating these county observations as observations on a specific dependent 
variable, 14 equations were developed for the 1950 data, using 14 different 
dependent variables and the same set of (1950) explanatory variables. In 
similar fashion, 11 equations were developed for 1960. 

Because there was the likelihood of correlation among the disturbance 
terms of each set of equations, multivariate analysis was employed. 

It may be noted that estimates of coefficients turn out to be the same 
as those obtained in univariate least-squares regression. However, the con- 


cern is with the equation system rather than with individual equations. In 





particular, there is focus on the behavior of a variable's coefficients over 
the set of equations rather than on considering individual parameters an 
equation at a time. Statistical tests employed reflect this focus. 

Results were generally encouraging, with "important" explanatory vari- 
ables behaving as expected. Thus, increasing the level of education increases 
the number of persons in the higher income classes and reduces the number in 
the lower income classes. Increased urbanization shifts income distributions 
in the same fashion, while increasing the distance to the nearest trading 
center has the reverse effect. Other variables included in the analysis were 
measures of real estate value per farm, specialization, irrigation, road 
quality, tenure, age, and average farm size. 

Working with components of the income distribution rather than with the 
average based on those components is intuitively satisfying. Hopefully, results 
for the distribution should yield more accurate forecasts and may also afford 
some basis for policy decisions. 

In the following sections of the report, each phase summarized here will 


be described in detail. 


Investigation of Regional Patterns of Farm Income 


This section reports the first phase of the present study, which was the 
investigation of regional patterns of farm income. This section considers 
(1) the data employed, (2) average income estimates obtained, (3) a technique 
for spatial analysis mapping, and (4) patterns of income and of income growth 


and comparisons of farm and nonfarm income. 


Income Distribution Data 
Selected data on the size distribution of income by California counties 


for certain sectors for 1949 and 1959 are available in the 1950 and 1960 U. S. 








1/ 


Census of Population, respectively.— The data appearing here, by counties, 


were the basic source for this study =" Thus, the regional unit was the county. 
As noted above, the reference dates for the data throughout this study will be 
1950 and 1960 for simplicity of exposition. 

The unit of observation is either "family" (f) or "family and unrelated 
individuals" (fu); this latter category is also referred to as "family and un- 
related." The sectors are those defined by the Census by residence. Parti- 
cular sectors will be indicated by the following abbreviations: rural farm 
(RF); nonfarm (N), consisting of rural nonfarm and urban combined; and All 
sectors (A), consisting of rural farm and nonfarm combined. 

County income distribution data cover only part of the sectors of interest 
in both the 1950 and 1960 cases. For 1950, county data were available on family 
and unrelated units for the All and the nonfarm categories and on family units 
for the All category only. For 1960, county data were available only on family 
units; further, data on nonfarm units were not included. Some of the 1950 and 
1960 data gaps were filled by direct calculation and some by estimation. The 
array of data which were initially available plus that developed by direct cal- 
culation and estimation are summarized. 

Direct calculation was used to obtain (1) 1950 family and unrelated rural 
farm data and (2) 1960 family nonfarm data, which were calculated by subtraction 


and addition, respectively, of other categories of information. The 1950 family 


1/ U. S. Bureau of the Census, U. S. Census of Population: 1950. Vol. II, 
Characteristics of the Population, Part 5, California, 1952, 5-438p. 


Idem, U. S. Census of Population: 1960. Vol. I, Characteristics of 
the Population, Part 6, California, 1963, 6-990p. 





2/ The 1960 Census data differ from the 1950 data in several characteristics. 
The 1950 Census income data are generally based upon a 20-percent sample while 
the 1960 data are based upon a 25-percent sample. For both years, Census income 
data include only dollars of income received and do not record imputed income, 
for example, rental value of owned housing. 


ae tal 





Categories of Available and Developed Data on County Income Distributions 
1950 and 1960 


Family and unrelated (fu) (f) 


All = URB + RNF + RF : All 


nonfarm = URB + RNF : nonfarm (estimated) 


rural farm (direct : rural farm 
calculation) (estimated) 


All 


Data not developed for : nonfarm (direct 
this category calculation) 


rural farm 





* Indicates data were developed by direct calculation or by estimation pro- 
cedure. Direct calculation means data were obtained by subtraction or 
addition of other categories. Items not indicated by * are available in 
the Census. 


(Meaning of symbols not defined above: URB = urban; RNF = rural nonfarm.) 


nonfarm and the family rural farm distributions were estimated in more involved 
fashion. It was assumed that the proportionate relation between corresponding 
elements of the family and family and unrelated individuals for the 1950 All 
sector also applied to the 1950 nonfarm and farm sectors. Thus, for the farm 


sector it was assumed that 


for each county and income class. Table 1 exhibits the range and number of 
income class intervals for county income distributions by sector for 1950 and 
1960. 

For the 1950 data, all income intervals contain income recipients for 


the rural sector, with the exception of San Francisco. For 1960, the Census 





TABLE 1 


Range and Number of Class Intervals for County Income Distributions 
by Sector, 1950 and 1960 







1950 The All family Other family 
Every sectora sector sectorsb/ 


thousand dollars 






Income class 
intervals 





a/ Includes A(f), A(fu), N(f), N(fu), and RF(fu). 


b/ Includes N(f) and RF(f). 


A = All (f) = family 
N = nonfarm (fu) = family and unrelated 
RF = rural farm 








published data only for counties with a rural farm population of 400 or we 
Consequently, the following six counties had no observations in the rural farm 
sector: Alpine, Inyo, Mono, San Francisco, Sierra, and Trinity. 

A number of problems arise in working with Census income data. These 
include nonresponse (applicable in both phases of the present study) and the 
estimation of average income given skewed income distributions (applicable in 
the first phase of this study). Techniques developed to handle these problems 
are discussed in Appendix A of this report. In addition, Appendix A considers 
(1) the problem of differences in U. S. Office of Business Economics and Census 
measures of income and (2) the problem of comparability of income measures 
based on family income versus measures based on family and unrelated income. 
The discussion of these problems can be of relevance in applications of results 
presented here. 

Average Income Estimates.--Using the estimating procedures for the Census 
income distribution data described in Appendix A, average income estimates by 
county were developed and are presented for 1950 in Table 2 and for 1960 in 
Table 3. (The county ordering in these tables is on the basis of location, 
running from north to south. 

Table 2 lists family and unrelated average income for the All, nonfarm, 
and rural farm sectors; and it lists family average income for the All and rural 
farm sectors. In Table 3, estimated averages for 1960 All family and rural farm 
family are presented. 


1/ U. S. Bureau of the Census, U. S. Census of Population: 1960... ., 
p. 6-458. 


2/ For a map of California counties coded in this order and an alphabetical 
list of counties and corresponding code numbers, see Figure B-1l, infra, p. 91. 


TABLE 2 


Estimates of Average Income by Area, Unit, and Sector, 1950 


Family and unrelated 
Area 


dollars 


California ; ‘ 3,966.000 | 3,576.000 | 3,576.000 | 3,569.000 


County 


Modoc 
Siskiyou 
Del Norte 
Humboldt 
Trinity 
Shasta 
Lassen 
Plumas 
Tehama 
Mendocino 
Glenn 
Butte 
Sierra 
Nevada 
Yuba 
Sutter 
Colusa 
Lake 
Sonoma 
Napa 

Yolo 
Placer 

El Dorado 
Sacramento 
Solano 
Marin 
Contra Costa 
San Joaquin 
Amador 
Calaveras 
Alpine 
Mono 
Tuolumne 
Stanislaus 
Alameda 
San Francisco 
San Mateo 





(Continued on next page.) 





TABLE 2--continued. 










Family and unrelated 








Source: 





Santa Cruz 3,663.374 | 355772560 2;,985..336 
Santa Clara 4,389.024 | 4,280.122 3,463.137 
Merced 3,026.666 | 4,012,537 3,456.058 
Mariposa 3,298.972 | 35359.639 2,642.868 
Madera 3,195.820 | 3,430,963 3, 167.574 
Inyo 4.114.026 | 4,503.324 3,601.101 
Fresno 3,835.608 3,681.667 3,249.052 
San Benito 3.757.143 | 3.928.387 3,284.837 
Monterey 4,281.370 | 4,423.380 327 1.210 
Kings 3,861.173 | 3,889.844 3,030.97 1 
Tulare 3,404.748 | 3,814.849 3,468.637 
San Bernardino 3,439 .599 3,444 .219 2,961.119 
Kern 4,159.347 | 4,084.936 3,516.,556 
San Luis Obispo 3,608,211. | 4,595. 268 3,728.429 
Santa Barbara 4,362.000 | 4,704.896 3,746.191 
Ventura 4,249.011 | 4,975.050 4,362.227 
Los Angeles 4.352.107 | 4,591,338 3,749.414 
Orange 3,857.954 | 4,915.320 4.271.259 
Riverside 3,303.894 | 3,502.511 2,981.172 
Imperial 3,642.966 | 3,046.302 25238.778 
San Diego 4,050.935 | 4,590.450 3,478.041 


a/ Blanks indicate not applicable. 


























































U. S. Bureau of the Census, U. S. Census of Population: 1950. Vol. II, 


Characteristics of the Population, Part 5, California, 1952, p. 5-71. 








TABLE 3 


Estimates of Average Family Income by Area 
for All and Rural Farm Sectors, 1960 


| Rural farm | 


dollars 











County 


Modoc 6,777.64 8,000.83 
Siskiyou 6,391.00 65,859...72 
Del Norte 75316), 5 8,975.51 
Humboldt 7,407.97 8,109.41 
Trinity 6,839.52 a/ 
Shasta 6,850.89 6, 964.57 
Lassen 65524.63 6,882.35 
Plumas 6,583.66 6,693.19 
Tehama 6,194.71 6,313.83 
Mendocino 6,704.68 65427 463 
Glenn 6,252.48 6,485.28 
Butte 6,208.15 7,106.60 
Sierra 6, 161.18 

Nevada 6,253.78 7 h79'.57 
Yuba 5,960.62 6,776.24 
Sutter 6,789.29 7,540.84 
Colusa 6,939.46 8,134.04 
Lake 5,648.90 7,017.75 
Sonoma 6,626.77 6,808.85 
Napa 7,294.79 8,198.62 
Yolo 7,104.44 8,018.47 
Placer 65:829).91 6,531.62 
El Dorado 7,639.30 7,367.28 
Sacramento 8,204.40 8,040.16 
Solano 6,838.53 15072047 
Marin 9,958.70 75912 «54 
Contra Costa 8,586.03 8,264.94 
San Joaquin 6,903.36 1383.04 
Amador 6,216.77 7,949.03 
Calaveras 6,211.82 4,768.74 
Alpine 55 143.75 

Mono 7,936.66 

Tuolumne 6,289.88 6,106.78 
Stanislaus 6,192.09 6,152.85 
Alameda 7,941.92 7,665.73 
San Francisco 8,163.15 b/ 





California 





8,000.00 


(Continued on next page.) 
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7,295.00 







































TABLE 3--continued. 











San Mateo 
Santa Cruz 
Santa Clara 
Merced 

Mariposa 
Madera 

Inyo 

Fresno 

San Benito 
Monterey 

Kings 

Tulare 

San Bernardino 
Kern 

San Luis Obispo 
Santa Barbara 
Ventura 

Los Angeles 
Orange 
Riverside 
Imperial 
San Diego 

























a/ Blanks indicate no data available. 


b/ Not applicable. 


Source: U. S. Bureau of the Census, U. S. Census of Population: 


1960. Vol. I, Characteristics of the Population, Part 6, 
California, 1963, p 6-250. 


ee 





The mean values presented in Tables 2 and 3 are’always above the corre- 
sponding median values presented in the Census, reflecting the skewness of in- 
come distributions. For the All and nonfarm sectors, the ratio of median to 
mean ranges from about .70 to .90. 

For the rural farm sector, however, the ratio ranges Fri .66 to .86, 


which can be interpreted as indicating a greater degree of skewness in the 


farm than in the nonfarm income distribution. 


A Technique for One Variable Spatial Analysis Mapping 


Given average income by county for a particular sector, a computer pro- 
gram was designed to plot this information on a map of California. The com- 
puter program is of general applicability for spatial analysis mapping, though 
the version employed here applies specifically to counties of the state of 
California. 

In the program, only one variable is analyzed at a time, and discrete 
classes are formed for otherwise continuous variables. Spatial analysis maps 
of California are then produced, with a series of maps being produced for each 
variable of interest. For each variable, a set of maps shows counties which have 
a high, medium, or low value for that variable. The following criterion was 
employed in forming these discrete classes. If an observation on a given county 
is above the mean plus 45 percent of the standard deviation for the sample of 
all counties, then the given county is assigned to the high classification. Simi- 
larly, if the observation is below the mean minus 45 percent of the standard de- 
viation, then the county is assigned to the low classification. If the obser- 
vation is equal to or between these limits, the county is assigned to the medium 
classification. The limits selected (by the way of plus or minus 45 percent of 
the standard deviation) are appropriate in a situation where it is desired to 


divide a normal distribution into three approximately equal classes. The limits 


2— 





of the mean plus 45 percent of the standard deviation and the mean minus 45 per- 
cent of the standard deviation cut off areas of 32.635 percent of the normal dis- 
tribution. The middle class then contains approximately 34.73 percent of the 
area under the normal curve. 

The program developed is a general-purpose program and can be used for 
any variable of interest. In addition, the program is not limited to a three- 
category classification; as many as nine categories are possible with the present 
program by changing only one subroutine, and it would be a simple matter to ex- 
tend the program to handle larger numbers of categories. Again, the mapping can 
be developed for any state, region, or spatial Sie 

Spatial analysis mappings for 1960 average incomes appear as Figures 1 and 
2, with Figure 1 presenting maps for nonfarm income and Figure 2 presenting maps 
for farm income. The high, medium, and low category counties of the mappings are 
presented in Tables 4 and 5. 

Table 4 lists a cross-classification of counties in terms of nonfarm average 
income for 1950 and 1960. In the resultant matrix, a county on the main diagonal 
has remained in the same relative position between periods. This occurs for 11 
out of 18 low counties in 1950, 15 out of 25 medium counties in 1950, and 9 out 
of 15 high counties in 1950. Thus, 35 out of 58 counties are on the main diago- 
nal; that is, they do not change position. Further, only two counties move from 
low in 1950 to high in 1960, and only three move from high in 1950 to low in 1960. 
Examining the extremes, it may be noted that the high-income counties are urban- 
ized counties; the low-income counties appear in the central part of the state 
and generally are clustered around the outer portions of the Central Valley. 


1/ For additional references on spatial analysis mapping, see Elliot L. 


Amidon, A Computer-Oriented System for Assembling and Displaying Land Manage- 


ment Information, U. S. Forest Service Research Paper PSW-17 (Berkeley, Cali- 
fornia: Pacific Southwest Forest and Range Experiment Station, 1964), 34p. 
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FIGURE 1. Mapping of High, Medium and Low Average Income by County for the Nonfarm Population 
California, 1960 (Mapping by computer program ) 
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URE 2. Mapping of High, Medium, and Low Average Income by County for the Rural Farm Population 
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Cross-Classification of Rankings of Counties for Average Nonfarm Family Income, 1950 and 1960 


Rankings of 
counties 
in 1960a/ 


Mariposa 
Madera 
Low Tulare 
Santa Cruz 
San Benito 
Del Norte 
Trinity 
Calaveras 
Medium San Bernardino 
Riverside 


El Dorado 
High Mono 


TABLE 4 


Rankings of count 
Medium 


Tehama 
Glenn 
Yuba 
Stanislaus 
Merced 
Tuolumne 


ies in 19508/ 


Siskiyou 
Kings 
Inyo 


Modoc Sonoma Humboldt 
Shasta Solano Lassen 
Plumas San Joaquin Kern 
Mendocino Fresno 


Colusa Mont 


erey 


Sutter San Luis Obispo 


Placer Impe 
Yolo 


Napa 

Santa Barbara 
Orange 

San Diego 


a/ The number of counties in each category is as follows: 


1960 


Low 


Medium 


High 


1950 
Low Medium 
Lk. 6 
5 15 
2, 4 


rial 


Sacramento 
Marin 

Contra Costa 
Alameda 
Santa Clara 





Sources: Developed from data in U. S. Bureau of the Census, U. S. Censu : 


Characteristics of the Population, Part 5, California, 1952, 
Population: 1960. Vol. I, Characteristics of the Population, Part 6, California, 1963, 


p. 5-71; and idem, U. S. Ce 











San Francisco 
San Mateo 
Ventura 

Los Angeles 


<_ Vol. IT. 
nsus of 
p. 6-250. 











TABLE 5 


Cross-Classification of Rankings of Counties for Average Rural Farm Family Income, 1950 and 1960 


Rankings of 
counties 
in 1960a/ 


Low 


Mendocino 
Yuba 
Placer 
Calaveras 
Mariposa 


Lake 

Nevada 

Solano 

San Joaquin 
Santa Cruz 
San Bernardino 


Del Norte 
Sacramento 


Trinity 
Sierra 


Riverside 
Imperial 


Tehama 
Lassen 
Sonoma 
Stanislaus 
Tuolumne 


Shasta 
Butte 
Sutter 
Marin 

El Dorado 
San Benito 


Modoc 
Yolo 
Napa 
Amador 


a/ San Francisco County is excluded, having no rural farm sector. 


as follows: 


Merced 
Madera 
Kings 


Alameda 
Santa Clara 
Fresno 
Monterey 
Tulare 

Kern 


San Mateo 
San Diego 


The number of counties in each category is 


1950 
1960 Low Medium High 
Low 5 8 3 
Medium 8 12 1 
High 2) 6 v7] 
No data 2 1 2 


b/ No 1960 data available for these counties. 


Sources: 


Developed from data in U. S. Bureau of the Census, U. S. Census of Population: 


Rankings of counties in 19508/ 


eee eee es 


High 


Siskiyou 
Plumas 
Glenn 


San Luis Obispo 


Humboldt 
Colusa 

Contra Costa 
Santa Barbara 


1950. 


Ventura 
Los Angeles 
Orange 





Vol. II, 


Characteristics of the Population, Part 5, California, 1952, p. 5-71; and idem, U. S. Census of 
Population: 1960. Vol. I, Characteristics of the Population, Part 6, California, 1963, p. 6-250. 














Table 5 lists a cross-classification of counties in terms of farm average 
income for 1950 and 1960. There is again some evidence of regional clustering, 
but farm income appears much more variable than nonfarm income. For counties 
with data available, 5 out of 15 low counties, 12 out of 26 medium counties, and 
7 out of 11 high counties for 1950 remain in the same classification in 1960. 
Thus, only 24 out of 52 counties are on the main diagonal; that is, they do not 
change position. As a fraction, this is .46 as compared to the nonfarm fraction 
of .60. 

Southern California coast counties are in the high category as are Contra 
Costa, Colusa, and Humboldt. The persistently low-income counties include 
Mendocino and four counties on the eastern edge of the Central Valley. 

The mappings tend to confirm the T. W. Schultz location hypothesis, which 
involves the argument that average farm income is a function of proximity to 


if 


metropolitan markets, that income increases as access increases.— This re- 


lationship seems particularly strong for the 1960 case. 


Intersectorial and Intertemporal Comparisons 


Given the results developed in the preceding section, a number of compari- 
sons between sectors and over time became possible and were carried out. 
Intersectorial comparisons were carried out through comparison of rural 


farm to nonfarm results for both 1950 and 1960 data. Ratios of average income 


2/ 


for farm relative to nonfarm are presented in Table 6.— (It should be noted 


1/ Theodore W. Schultz, The Economic Organization of Agriculture (New York: 
McGraw-Hill Book Company, Inc., 1953), Chaps. IX, X, XVII, and XVIII. 


2/ Computation of the necessary mean values employed here involved some ad- 
ditional data manipulation. In particular, some income classes were aggregated 
so as to make use of identical average values for the upper tail of the empiri- 
cal distributions. For this reason, the means used here are not necessarily 
identical with those presented earlier in this section. 


=18= 





TABLE 6 
Intersector Ratio Comparisons of Average Family Income: 


Rural Farm Relative to Nonfarm Sector by County 
1950 and 1960 


; Family and unrelated, 1950 Family, 1960 
Ratio of :ural farm Ratio of rural farm 
Count O nonkernt! O nonfarn2/ 


Modoc 0,96 1,13 
Siskiyou 1,13 1.00 
Del Norte 0.78 1.18 
Humboldt 0,99 1.02 
Trinity 0.65 b/ 
Shasta 0.90 0.93 
Lassen 0.96 0.95 
Plumas L.p3 0.92 
Tehama 1.08 0.94 
Mendocino 0.86 0.89 
Glenn Lo2? 1.00 
Butte 1.09 1.08 
Sierra 0.90 

Nevada 0.96 1.07 
Yuba 0.92 1,05 
Sutter 1.00 1.04 
Colusa 1.26 1. L7 
Lake 0.99 1.23 
Sonoma 1.00 0.95 
Napa 0.98 1.03 
Yolo 1.03 1.02 
Placer 0.79 0.86 
El Dorado 1/05 0.87 
Sacramento 0.78 0.87 
Solano 0.82 0.93 
Marin 0.79 0.73 
Contra Costa 1.03 0.87 
San Joaquin 0.86 1.00 
Amador LOL Leh 
Calaveras 0.96 0.67 
Alpine 3.81 

Mono 1.33 

Tuolumne 1.08 0.88 
Stanislaus 1.08 0.93 
Alameda 0.93 0.88 
San Francisco ef c/ 
San Mateo 0.67 0.94 


(Continued on next page.) 


iQ 


TABLE 6--continued. 





Family and unrelated, 1950 Famil 1960 

Count to nonfarma/ to nonfarma/ 
Santa Cruz 
Santa Clara 
Merced 
Mariposa 
Madera 
Inyo 
Fresno 
San Benito 
Monterey 
Kings 
Tulare 
San Bernardino 
Kern 
San Luis Obispo 
Santa Barbara 
Ventura 
Los Angeles 
Orange 
Riverside 
Imperial 
San Diego 


a/ Ratio of rural farm to (urban plus rural nonfarm). 
b/ Blanks indicate no data reported for the counties involved. 


c/ Not applicable. 


«20S 








that the 1950 results are based on family and unrelated income, while those 
for 1960 are based on family income. Because of differing proportions of 
family and of unrelated units in farm and nonfarm populations, these bases are 
not strictly equivalent. Use of the family and unrelated base will tend to 
raise the farm average relative to the nonfarm. This qualification should be 
kept in mind in considering the results below.) The ratio comparison for 1960 
is exhibited in the map of Figure 3. 

On the whole, California rural farm and nonfarm average incomes do not 
diverge particularly. Thus, for 1950, 31 counties have an income ratio above 
1.0 and 26 counties have an income ratio below 1.0; in 1960, the distribution 
is 24 and 28, respectively. Again, for 1950, 37 counties (out of 57) have an 
income ratio between .9 and 1.1; for 1960, 32 counties (out of 52) fall within 
these limits. There are four counties with a ratio below .9 in both years 
(Mendocino, Placer, Sacramento, and Marin) and five counties with a ratio above 
1.1 in both years (Colusa, Madera, Tulare, Ventura, and Orange). 

Growth in average income over time was measured and compared using the 
compound growth rate per year. The compound growth rate was estimated for each 


county using the relation: 


cree = (1+ Oo” Fey 
where 
% = average income 
t = 1950 and 1960 
and 
(1 + r) = yearly compounding factor. 
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FIGURE 3. Comparison of Rural Farm to Nonfarm Average Income for the 
Family Observation Unit by County, 1960 
(Mapping by computer program ) 
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Results obtained appear in Table 7 for the nonfarm and the rural farm sectors 
using family income estimates. 

It is worth noting that the data in Table 7 are not deflated for changes 
in the general price level. Such deflation can be carried out with relative 
ease if estimates of growth in real income are desired 

For rural farm income, counties with high rates of growth appear to fall 
into regional clusters. Thus, counties with growth rates above 7 percent (in 
Table 7) include clusters in the southeastern part of the state, in the northern 
San Joaquin area, and above the Bay area (Napa, Lake, and Mendocino counties). 

If rural farm growth rates are compared to nonfarm growth rates, it turns 
out that the rural farm rate is higher for 22 counties and lower for 30 counties 
(of the 52 counties for which data are available). Regional patterns seem fairly 
well defined, with farm rates above nonfarm for the southern San Joaquin Valley, 
the eastern part of the Sacramento Valley, and for some scattered counties at 
the extreme portions of the state. 

A final set of comparisons can be obtained by examining changes in average 
income rank between 1950 and 1960. (Income rates were obtained from the data 
of Tables 2 and 3. They are presented and discussed in Appendix B.) Compari- 
sons were carried out by subtracting the 1960 rank from the 1950 rank; if a 
county moved up in rank (and, hence, had a lower rank number), this difference 
is positive. Similarly, a decline in rank yields a negative difference. Results 
are plotted in Figures 4 and 5. 


1/ For example, on a 1957-1959 base, the Bureau of Labor Statistics consumer 
price index increased from 83.0 to 101.5 between 1949 and 1959. These figures 
yield a compound growth rate in the price level, p, equal to 2.03 percent per 
year. The deflated income growth rate, r*, is then found from the following 
equation: (1+ 4r) = (1+ r*) (1 +p). 


a 





TABLE 7 


Estimated Yearly Growth Rates of Average Family Income 
for Rural Farm and Nonfarm Sectors by County 
for the Period 1950-1960 


Compound growth rate, r 


Modoc 
Siskiyou 
Del Norte 
Humboldt 
Trinity 
Shasta 
Lassen 
Plumas 
Tehama 
Mendocino 
Glenn 
Butte 
Sierra 
Nevada 
Yuba 
Sutter 
Colusa 
Lake 
Sonoma 
Napa 

Yolo 
Placer 

El Dorado 
Sacramento 
Solano 
Marin 
Contra Costa 
San Joaquin 


Amador 
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(Continued on next page.) 
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TABLE 7--continued. 


esis eee 
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Calaveras 


~— 


Alpine 

Mono 

Tuolumne 
Stanislaus 
Alameda 

San Francisco 
San Mateo 
Santa Cruz 
Santa Clara 
Merced 
Mariposa 
Madera 

Inyo 

Fresno 

San Benito 
Monterey 
Kings 

Tulare 

San Bernardino 
Kern 

San Luis Obispo 
Santa Barbara 


Ventura 


Los Angeles 


Orange 
Riverside 


Imperial 


YN DWwMON DNN DD WAT UN UD AHDU DUN UH WADKDATDAHD HDD CO WN 


San Diego 





a/ Blanks indicate no data available. 
b/ Not applicable. 


Sources: Developed from data in U. S. Bureau of the Census, 


U. S. Census of Population: 1950. Vol. II, Characteristics 
of the Population, Part 5, California, 1952; and idem, U. S. 


Census of Population; 1960. Vol. I, Characteristics of the 
Population, Part 6, California, 1963. 
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Includes counties dropping 
more than 3 ranks 


Includes counties increasing 
more than 3 ranks 







These counties have rank change 
between ~3 and 3, 


FIGURE 4 Change in Rank of Counties for Nonfarm Income between 1950 and 1960 
(numbers indicate amount of change in rank; thus, a positive number 
indicates a county is higher in rank--- relatively better off ) 
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Ray Includes counties dropping 
more than 3 ranks 


Includes counties increasing 
more than 3 ranks 


Po] These counties have rank change 
between -3 and 3 





FIGURE 5 Change in Rank of Counties for Rural Farm Income between 1950 and 1960 
(numbers indicate amount of change in rank; thus, a positive number 
indicates a county is higher in rank --- relatively better off) 
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The results can be interpreted as indicating changes in relative position. 
There seem to be well-defined regional patterns in Figures 4 and 5. 

Thus, the farm and nonfarm patterns show marked similarity. Increased 
rank (higher relative position) occurs for southern California (particularly 
in proximity to Los Angeles), for some counties in the Bay area, and for coun- 
ties near Sacramento County. Decreased rank occurs in the San Joaquin Valley 
and in the northern part of the state. A fairly obvious hypothesis here is 
that increased urbanization (and suburbanization) is the major factor at work 
in these shifts. It is worth stressing that these shifts occur for the farm as 
well as nonfarm sectors. 

The work of this section concludes with a brief investigation of the 
hypotheses that (1) growth in average family income is related to base period 
(1950) income and (2) level of rural farm average income is related to non- 
farm average income. 

It was hypothesized that growth in average income is negatively related 
to base period income, reflecting equilibrating forces at work in the economy. 
The hypothesis was tested by regression of income growth from 1950 to 1960 on 
base year income (1950) by county. This regression equation was fitted for 
both the nonfarm and the rural farm sectors. Regression results obtained 
appear in Table 8. 

The results obtained were in line with expectations in that a negative 
slope was obtained in each equation. The R* values were not particularly high; 
however, t-ratios were significant at the 5-percent level for the rural farm 
sector and close to significant for the nonfarm sector. 

The hypothesis that level of rural farm average income is related to non- 
farm average income was also tested by regression analysis. It was expected 
that slopes would be positive and this in fact occurred. Increasing levels of 


nonfarm income are associated with increasing levels of farm income. 
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TABLE 8 


Regression Results Obtained in Relating County Growth Rate of Average Income 
(for the Period 1950-1960) to Base Year Average Income (1950) 


by Nonfarm and Rural Farm Sectors 
pet ea Coefficient 
Nonfarm 





Number of 
county ob- 
servationsa 











Rural farm 


a/ Six counties had no data on rural farm income in 1960. 


Table 9 lists the results obtained for rural farm income regressed on 
nonfarm income for both 1950 and 1960. It also lists results obtained when 
rural farm growth rates are regressed on nonfarm growth rates. In all cases, 


estimated coefficients are significantly different from zero at the 5-percent 


level. 


A Multivariate Income Distribution Determination Model 


The second major phase of this study is concerned with the construction 
and application of a multivariate model of factors determining the size disitri= 
bution of income. The empirical application here involves the California rural 
farm sector for 1950 and 1960 (using family income data), but it is expected 
the model will be of general interest and applicability. 

Material covered in this section is organized into the following parts: 
(1) a discussion of previous theoretical and empirical work and a presentation 
of the general approach used here, (2) a formal summary of the statistical model 
and of tests of hypotheses for that model, (3) a discussion of the specific form 


of the applied model and a statement of results obtained, and (4) interpretations 
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TABLE 9 


Regression Results Obtained in Relating Rural Farm Average Income Measure 
to Nonfarm Average Income Measure, 1950 and 1960 


Period Number of 
and income county 
measurea/ observationsb/ Constant Coefficient t-ratio 


1950 average 

income 52 . 366 
1960 average 

income 52 Pele pil 
Annual rate 

of growth 

in average 

income for 

the period 

1950-1960 52 4270: 


a/ The measure listed is used in the regression of rural farm values on nonfarm values; the 
rural farm measure is the dependent variable, and the nonfarm measure is the independent 
variable. 
























0.2445 









b/ Six counties had no data available on rural farm income in 1960; these counties were also 
excluded from the 1950 regression for consistency in comparison. 








of those results. This study then concludes with a summary of significant find- 


ings and some suggestions on the direction of future work. 


Size Distribution of Income: Previous Work and Approach Used Here 

No comprehensive theory of the size distribution of income exists at the 
present date. Even though attempts at theorizing began as early as the Physio- 
crats, no complete theory has been developed” Two approaches have emerged 
over the years. One is concerned with the functional distribution of income, 
while the other is concerned with the personal (or size) distribution of in- 
come. Functional distribution of income theories are the best known for they 
center around economic theory and the marginal pricing of the factors of pro- 
duction. However, the case is usually abstracted to a single income-earning 
unit with the objective of explaining that unit's level of income in terms of 
conventional theory. The other area of theoretical work, which has concentrated 
on explaining the shape of the personal distribution of income, is relatively 
in its infancy. 

An example of theoretical work in this area is that of Robert Summers, who 
was concerned with estimating the shape of lifetime size distributions and 
isolating some of the relevant factors in determining that diaper Thus, the 
problem was to construct a theoretical framework which could be used to estimate 
the prospective lifetime size distribution of average annual income for a "house- 


hold" which remained the same in an economy that did not change. The approach 


is temporally dynamic but assumes no structural changes in the economy. 


1/ For a historical perspective, see John Maurice Clark, "Distribution," 
Readings in the Theory of Income Distribution ("Blakiston Series of Republished 
Articles on Economics," Vol. III; Philadelphia, Pennsylvania: The Blakiston 
Company, 1949), Chap. 3, pp. 58-71. 


2/ Robert Summers, An Econometric Investigation of the Size Distribution of 
Lifetime Average Annual Income, Stanford University, Department of Economics, 


Technical Report No. 31 (Stanford, California, 1956), 127p. 
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Empirical work in the area has involved attempts at direct analyses of the 
influence of selected variables upon certain income statistics or indirect analy- 
ses by first standardizing the distribution, recomputing the statistic from the 
distribution, and then examining the effect upon the statistic. An example of 
the first approach is the work of Senn! Zeman made adjustments for certain 
influencing variables directly upon his indices of wages. An example of the 
second approach is that of Smolensky, who first standardized the income distri- 
bution and then examined the effect upon a statistic of the distribution (the 
coefficient of ganiavies) = 

The above general approaches have the disadvantage of not indicating or 
measuring the functional relation between the supposedly influencing variables 
and the income distribution. Thus, the statistical significance of the findings 
cannot be tested. 

F. Gerard Adams moved toward statistical tests of functional relations in- 
volving income distributions by constructing a model which "explains" the "amount 


3/ 


of an individual's income." This model can be used to make statistical tests 


of relations between explanatory variables and the variable amount of individual 
income. But it cannot be used to make statistical tests of the influence of 


explanatory variables upon the distribution of income. 


1/ Morton Zeman, "A Quantitative Analysis of White-Nonwhite Income Differ- 
entials in the United States" (unpublished Ph.D. dissertation, Department of 
Economics, University of Chicago, 1955). A discussion of Zeman's dissertation 
appears in Robert J. Wolfson, "An Econometric Investigation of Regional Dif- 
ferentials in American Agricultural Wages,'"' Econometrica, Vol. 26, No. 2 
(April, 1958), pp. 225-257. 


2/ Eugene Smolensky, "An Interrelationship Among Income Distributions," 
Review of Economics and Statistics, Vol. XLV, No. 2 (May, 1963), pp. 197-206. 


3/ F. Gerard Adams, "The Size of Individual Incomes--Socio-Economic Vari- 


ables and Chance Variation," Review of Economics and Statistics, Vol. XL, 
No. 4 (November, 1958), pp. 390-398. 
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Work here is concerned with estimating functional relations and in de- 
veloping statistical tests of the influence of explanatory variables on income 
distributions. It differs from previous empirical work by attempting to re- 
late observations in a given income interval to a set of explanatory variables. 
This would appear to be an innovation in income distribution studies, and some 
details of the procedure are worth noting at this point. 

Census data on family unit income distributions by county for the rural 
farm sector were employed in the first section of this study to construct county 
averages. They are employed again in this section as observations on a set of 
dependent variables. Each income class interval is associated with a correspond- 
ing dependent variable. The dependent variable consists of the proportionate 
number or frequency of income units measured in percentage terms. For each de- 
pendent variable, then, there will be as many observations as counties reported. 

Each of the dependent variables is related to a given set of independent 
variables using multivariate analysis. Observations on these variables are the 
same for all cases; that is, each dependent variable is related to the same set 
of independent variables. 

The meaning of coefficients obtained is of some interest. A negative co- 
efficient indicates that, as the independent variable increases, the number of 
units falling in the specific income interval decreases. Similarly, a positive 
coefficient indicates an increasing number of units in the interval as the in- 
dependent variable increases. The multivariate approach employed here allows an 
investigation of the behavior of a coefficient in each income class in turn. 

A good deal of additional information (relative to work with income class aver- 
ages) is thereby generated. With results obtained for different time periods, 


changes in structure can then be investigated. 
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The Statistical Model 
The statistical model and hypothesis-testing approach of this section is 
an application of the theoretical and empirical works of Anderson, Goldberger, 
Hooper and Zellner, Hotelling, Kendall, O'Regan, Schatzoff, and Wilke 2! 
Consider the following general linear model formulation for any sector. 


For G income intervals, we define the following set of G equations: 


1) = Bayo * Beaya ¥1 + Bayo % + +++ + Bay XR t 4%) 
(2) = Bcayo * Bcaya ¥1 + Beaya Xo t+ oes + Beayg Xm + YQ) 


(3) = 8e3yo * 8¢3y1 ¥1 + Beay2 Xo + +++ + Bea) %& + 43) 


(c) = Bieyo + Big) Xx) + Bigya X, t+i.et BG) K X + UG) 


1/ T. W. Anderson, An Introduction to Multivariate Statistical Analysis 
(New York: John Wiley and Sons, Inc., 1958), 374p. 


Arthur S. Goldberger, Econometric Theory (New York: John Wiley and Sons, 
Inc., 1964), 399p. 


J. W. Hooper and Arnold Zellner, "The Error of Forecast for Multivariate 
Regression Models," Econometrica, Vol. 29, No. 4 (October, 1961), pp. 544-555. 


Harold Hotelling, "A Generalized T Test and Measure of Multivariate Dis- 
persion," Proceedings of the Second Berkeley Symposium on Mathematical Statis- 
tics and Probability (Berkeley and Los Angeles: University of California Press, 
1951), pp. 23-42. 


Maurice George Kendall, A Course in Multivariate Analysis (London: Charles 
Griffin and Company, Limited, 1957), 185p. 


William Gerard O'Regan, "An Experimental Approach to the Determination of 
Demand for Orange Concentrate" (unpublished Ph.D. dissertation, Department of 
Agricultural Economics, University of California, Berkeley, 1962), 74p. 


Martin Schatzoff, "Sensitivity Comparisons Among Tests of the General 
Linear Hypothesis,'' Journal of the American Statistical Association, Vol. 61, 
Part 1 (June, 1966), pp. 415-435. 


Samuel Stanley Wilks, "Certain Generalizations in the Analysis of Variance," 
Biometrika, Vol. 24 (November, 1932), pp. 471-494. 
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where a subscript in parentheses denotes income interval, g; g=1, ..., G. 
(Parentheses enclose g values for purposes of clarification.) 


This set of G equations can be written compactly as: 
Y= X68 + U 


where Y is an N by G regressand observation matrix (N being the number of 


observations): 


4@ Y1(2) eee Yu(6) 





Y9 (1) Y9(2) wee Y9(c) 


Yn) Yy2) siete Yn) 


Thus, the first column consists of all observations in the first income 
interval. In the present case, this consists of the 52 county observations 
falling in the lowest income class. A given row, i, then contains all the in- 
come interval observations for a given county, i, with i= 1, ..., 52 (given 
observations on 52 counties). Thus, the first column in the definition of Y 
corresponds to the set of observations on 7h) in the first equation of the 
linear model, and so on through the Gth column in the definition of Y. Income 
interval subscripts are again enclosed in parentheses. 

In matrix terms, X is an N by K + 1 regressor observation matrix, with 


observations on X invariant over income class: 
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a-t abe oe en 
: k= 1, ne, 
The coefficient matrix, 8, is of size K+ 1 by G: 
| 
Paya Pago # cee 
Praja Peay *** Preys 
8 -{e } , gs 1 a 
gk ; oe Carre 


Bay BayK atin Bex 


The disturbance terms appear in an N by G residual matrix: 


“aly "Maes 


Saciy ie 


The following assumptions are made with regard to the disturbance terms 


ig’ 
E (15) = 0. 


For fixed g and i varying, disturbances are assumed distributed indepen- 


dently with equal variance Tae" Thus, 


oe 








EMie uit Bie} Mig + Stay 


“N(g) 
0 en, 
ere 10) 
ss 
= ‘ =@ 1, 
8s 
0 fo) 
es 





In similar fashion, for specific county values, g and h: 


Msg inh E “1 (g) [Uy (ny wei UN (py! = Pe 


“N(g) 
cecil 


Thus, for the same observation number, disturbances in different equations 


have covariance oO for different observation numbers, disturbances in different 


gh? 
equations are independent. 


For any observation number, say i = k, we may write: 


Bf ece) kewn t® [Heery | Mecay “++ ace! 


"k(G) | 
“a “ie 8°? ie 

= ; {oun c x exc. 
[1 vey) oAi® revel 


a 





The general multivariate case does not assume 0, = 0 for g # h; if that 


gh 
assumption were made, the case resulting is that of a set of ordinary least- 
squares regression equations. If the disturbances or are assumed normal, the 
multivariate case becomes the classical multivariate normal linear regression 
model. 


1/ 


The 8 matrix may be estimated by the maximum likelihood method.— The 


final solution of this mathematical derivation is: 


“an 


where 8 is the matrix of estimated coefficients which, of course, corresponds 
to the least-squares estimated coefficient matrix. Goldberger develops the 
proof that individually minimizing the diagonal elements of t corresponds to 
minimization of the determinant of + 2/ 


The estimated variance-covariance matrix may be defined as follows: Let 


“aw Nn 


Y = X8 and V=Y - Y, then t = V'V/degrees of freedom. 

In computing results a stepwise procedure was employed. In this procedure 
the "best'' explanatory variable is introduced at the first step, the "second 
best" explanatory variable at the second step, and so on. The criterion em- 
ployed in selection of the variable to be introduced was that of minimization 
of the determinant of the estimated variance-covariance matrix, [f]. It should 


be noted that the employment of this criterion poses a problem of singularity, 


owing to the definition of Y in terms of Prequbicies < 


2/ Idem, pp. 205 and 206. 


3/ In the Y matrix for this case, each row sums to 100 and each column sums 
to 100. This results in a (Y - Y) matrix with the characteristics that (1) each 
row sums to zero and (2) each column sums to zero. When these characteristics 
hold for the basic matrix, the resulting matrix of sums of squares and cross 
products will have the same characteristics. As a consequence, any row will 
exactly equal the negative sum of all other rows. 
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To proceed with estimation in this situation, it is only necessary to 
eliminate one column of the Y matrix. Estimation can then be carried out for 
a set of G - 1 equations. To obtain the vector of 8 coefficients omitted in 
the first phase, an alternative column is eliminated and estimates obtained 
for the alternative set of G - 1 equations. In the present study, the first 
column and then the second column were eliminated. These cases were termed 
phase 1 and phase 2, vespectively 2’ 

One of the basic assumptions of the multivariate approach is that off- 
diagonal elements in t are nonzero. (As noted above, if this is not assumed, 
a univariate appwoash to estimation and hypothesis testing would be appropriate 
for each equation.) 


This assumption can be put in the form of a null hypothesis to be tested 


i t={,4 


where Ons is a diagonal matrix. The hypothesis can be tested using a test 


as follows: 


employed by Otepen” The test is based upon a statistic calculated from the 
estimated residual correlation matrix. This statistic is distributed under 
the stated hypothesis approximately like Chi square with nae degrees of 
freedom, where G is the number of equations. 

For the general model presented in this paper, the following methodologi- 
cal approach to multivariate statistical hypothesis testing will be followed. 


1/ In the case investigated here, the G - 1 equations in either phase pro- 
duced identical statistical results (as might be expected). 


2/ O'Regan, op. cit., pp. 26 and 27. 


ae 


nw “a 


Recall the definition of {, and now define t, as the estimated variance- 
covariance matrix of residuals under restrictions imposed upon the 8 matrix 
by the current hypothesis. 


Partition the 8 matrix as follows: 


[fs 
Bo 


by G (where K, + K, = K +1). 


where By is K 1 2 


by G and B, is K 


Hl 2 


Consider the set of hypotheses: By = B14 where Boy is a selected set of 


constants. The Anderson U statistic is now defined as: 


or, more simply, the U statistic can be computed as {ep that is, a ratio of 
R 
determinants. 

Recent investigations by Schatzoff substantiate this test statistic selec- 
tion.2/ Schatzoff concludes that using the ESL (expected significance level) 
criterion to judge the we most commonly suggested multivariate test statistics, 
the two most favorable statistics are the Hotelling T* and Wilks' likelihood 
ratio statistic. This latter statistic is identical to the (unadjusted) Ander- 
son U statistic above. Schatzoff also concludes that these two statistics are 
asymptotically equivalent. 

Now partition the X matrix analogously to that of the 8 partitioning. 
and N by K 


The resulting X, and xy submatrices have the dimensions N by K 


il: 1 2? 


respectively. 


A/ Sehatzoff, op. cit. 


ins 








The definition of the numerator of the Anderson U statistic is: 
|B] = [ce - x8)* ce - xB). 
The definition of the denominator of the Anderson U statistic is: 
i.) es Bye ~ % 8° @ HB =, 8) |. 


For the specified set of hypotheses, a transformation of the U statistic 


can be employed. Probability figures for this transformed U can be calculated 


1/ 


using the Chi square distribution. The transformation formula is:— 
Z = -AM [log (U)]. 


Log is a natural logarithm, and 


AM = AN - AK, - .» (6 + AK, + 1) 
where 
AN = number of observations 
AK, = number of unrestricted 8 vectors 
AK) = number of restricted 8 vectors 
and 
G = number of equations. 


The statistical routine which calculates the probability figures for the 
stated hypothesis uses the first two terms of Anderson's suggested formulation. 


In this general framework, the formula for the 85 submatrix of coefficients is: 


1/ Anderson, op. cit., Chap. 8. 
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Application of the Model and Statistical Results Obtained 


The multivariate model described above was applied as follows: California 
Census data on rural farm family income yielded a set of dependent variables. 
Each dependent variable consisted of the percent of total income units in 
a particular income fneerval 2! 
These interval definitions resulted in the estimation of 14 equations for 
1950 and 11 equations for 1960. Thus, for the 1950 model, G is equal to 14; and 
for the 1960 model, G is equal to 11. For both models, the number of observa- 
tions, N, is equal to 52. Six counties were eliminated because of lack of com- 
parable data over the jeeade 
The explanatory variables might be characterized as of two types: (1) meas- 
ures of characteristics of farm entrepreneurs (for example, education and age) 
and (2) measures of characteristics of the spatial unit (for example, index of 


urbanization, availability of irrigation water, etc.). 


In particular, the explanatory variables included the following measures: 


1. Real estate--average value of land and buildings per farm. 
(Values are expressed per $1,000.) This may be a useful proxy 
for total capital. 

2. Education--the recorded average education of farm operators in 
the county. (Values are expressed in average years of schooling 
completed.) 

3. Farm size--average acreage per farm in the county. (Original 
values were divided by 1,000.) 

1/ For a definition of income intervals, see supra, Table l, p. 6. 


2/ The counties eliminated were: Trinity, Sierra, Alpine, Mono, San Fran- 
cisco, and Inyo. 
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4. Isard index--a measure of farm-type specialization. This is a 
modified Isard coefficient of specialization and is discussed in 
detail betow A! 

5. Urban index--proportion of the county's population classified as 
urban. 

6. Irrigated farms--proportion of all farms using irrigation water. 

7. Road Class 1--proportion of all farms on hard surface roads 

8. Tenure--proportion of total farms operated by managers. 

9. Trading center--a measure of average distance in miles to an urban 
settlement. 


10. Age level--average age of farm operators. (Original values were 


divided by 10.) 


The Census of Agriculture (1950 and 1959) was the basic source of data for 
the explanatory variables defined shove” The Isard index may be considered 
in detail at this point. As noted above, the index used here is a modified 
Isard coefficient of specialization where the Isard coefficient of specializa- 


tion is defined as: 


1/ Walter Isard in association with David F. Bramhall et al., Methods of 


Regional Analysis: An Introduction to Regional Science ("Regional Science 
Studies Series 4," Vol. II; Cambridge, Massachusetts: The M.I.T. Press, 1960), 


Chap. 7, pp. 232-308. 


2/ Other road classes defined by the Census of Agriculture are: Road 
Class 2--gravel, shell, or shale; and Road Class 3--dirt or unimproved. 


3/ U. S. Bureau of the Census, U. S. Census of Agriculture: 1950. Vol. I, 


Counties and State Economic Areas, Part 33, California, 1952, 296p.; and idem, 


U. S. Census of Agriculture: 1959. Vol. I, Counties, Part 48, California, 
1961, 287p. 
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where 
C = Isard coefficient 


M = total employment in a comparison unit (for example, the entire 
area analyzed) 


M, = total employment in industry, i, of comparison unit 
S = total employment of analyzed unit 
S, = total employment in industry, i, of analyzed unit 
i = industry group 
and 
N = number of industry groups. 


For this study, the modified Isard index of farm-type specialization is 


defined as: 


1 i 
where 
C* = modified index 
P, = proportion of county farms which are classified in farm-type i 
and 
N = number of farm-type classes. 


The farm-type classification system used here is that defined by the 
Census of Agriculture. The following eight major types of farms are defined: 
commercial, vegetable, fruit and nut, poultry, dairy, livestock, general, and 
miscellaneous. The objective was to measure the heeed of farm-type specializa- 
tion. If the county had the same number of farms in each of the eight classes, 


the index value would be zero. The other limit for the index would occur if all 
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the county's farms were of one type. In this latter case, the index value 
would be 1.75. These are the lower and upper limits of the index for eight 
farm iuea 

Coefficient estimates were then obtained. (As noted above, singularity led 
to estimation of the coefficients in two phases, with phase 1 involving the elimi- 
nation of the first column or dependent variable and phase 2 involving the eli- 
mination of the second dependent variable.) Coetficient estimates for 1950 and 
1960 appear in Tables 10 and 11, respectively. 

The sample residual correlation matrices were calculated for each appli- 
cation of the model, with relatively high values occurring for off-diagonal 
elements in all cases. As an example, Table 12 lists the residual correlation 
matrix for phase 1 for 1950. 

The hypothesis that the off-diagonal elements equaled zero was tested by 
the appropriate Chi square test (as described above). Results appear as 
Table 13. 

In each of the cases, the probability that a table Chi square with ap- 
propriate degrees of freedom is greater than or equal to the calculated Chi 
square value is approximately equal to zero. Therefore, each of the hypotheses 
is rejected, and the multivariate approach is justified for each model. 

One method of evaluating the importance of a particular variable in a 
multivariate framework is to compute the percentage change in the determinant 
of the variance-covariance matrix of residuals due to that variable. 

Computationally, the models presented here result from entering indi- 
vidual explanatory variables in the order which reduces the current If] the 


1/ An index with minimum value equal to zero and maximum value equal to 1 
would be constructed by dividing all of these calculated values by 1.75. The 
value of 1.75 equals |1 - 1/8] + 7/0 - 1/8] = 7/8 + 7(1/8) = 14/8. 
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TABLE 10 


Estimated Coefficients, 1950 


Dilependent variables { 1 | dG 


Constant term and Income class intervala 
1 





coefficient estimates? 


bo 
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Constant term 
Urban index 
Real estate 
Education 
Tenure (managers) 
Trading center 
Road Class 1 
Isard index 
Age level 
Irrigated farms 
Farm size 


Constant term 
Urban index 
Real estate 
Education 
Tenure (managers) 
Trading center 
Road Class 1 
Isard index 
Age level 
Irrigated farms 
Farm size 


1 
~ 
OrFOAONOGOOOFOU® 


FOUNUONONOCOON 
FPrPNMNNMNWOUWODOfU 


See supra, Table 1, page 6, for income values included in each class interval. 


Number of decimal places varies between income class intervals because of a decision-rule to list data 
in terms of five significant digits without regard to location of decimal point. 
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TABLE 11 


Estimated Coefficients, 1960 


Constant term and Income class interval2 
independent variables cee ee Se “nes eee, ee a 


coefficient estimates? 

Constant term 

Education 

Real estate 

Farm size 

Trading center 

Road Class 1 

Age level 

Isard index 


Urban index 


Tenure (managers) 


Irrigated farms 


(Continued on next page.) 
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TABLE 11--continued. 


Constant term and Income class intervala 
independent variables [| = 7] BD) 
coefficient estimates? 


Constant term 
Education 

Real estate 

Farm size 
Trading center 
Road Class 1 

Age level 

Isard index 
Urban index 
Tenure (managers) 


Irrigated farms 





See supra, Table 1, page 6, for income values included in each class interval. 


Number of decimal places varies between income class intervals because of a decision-rule to list data 
in terms of five significant digits without regard to location of decimal point. 
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TABLE 12 


Residual Correlation Matrix, Phase 1, 1950 Model 


Income class 
interval and 
equation Income class interval and equation number 


number3/ 





(Continued on next page.) 
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equation 
number@. 


TABLE 12--continued. 
Income class 
interval and 
Income class interval and equation number 
a ee es  e ) ee | is | ee i ee 


Because of singularity, 
1, which is why the 





a/ There are 14 equations corresponding to income class in the model. 
Equation 1 is eliminated in phase 


one equation must be eliminated. 
sequence here runs from 2 through 14, 


TABLE 13 


Results Obtained in Testing the Hypothesis that Off-Diagonal Elements 
Are Zero in the Variance-Covariance Matrix of Residuals 
1950 and 1960 


Degrees of freedom Chi square values 


Phase 1 171.7704 


Phase 2 211.1496 


1960 
Phase 1 151.9075 


Phase 2 148.4943 





a/ In phase 1, column 1 of the Y matrix was excluded; in phase 2, 
column 2 of the Y matrix was excluded. This was done because of 
singularity in the original Y matrix. 
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maximum amount. Thus, in Table 14 each step indicates the amount of residual 
variation explained by introducing the best variable at that step. 

Thus, for the 1950 model the urban index is the first variable to enter, 
explaining 52 percent of the initial variance to be explained. It is followed 
by real estate, which explains 43 percent of the residual variance remaining 
after the urban index is introduced. This, in turn, is followed by education, 
which explains 39 percent of the variance remaining after the two preceding 
independent variables are introduced. For the 1960 model the first three 
variables in order of appearance are education, real estate, and farm size, 
respectively. 

A measure of all currently included variables' effect upon the unexplained 
variance is the percentage change in the current (¢)2/ relative to the determi- 
nant of the variance-covariance matrix about the hieass &* These values are 
presented in Table 15 for the 1950 and 1960 models, phase 1. This table, in 
effect, presents the cumulative percent reduction in unexplained variance. 

A statistical test of each model was carried out using the methodological 
approach to hypothesis testing defined earlier. The hypothesis was that all 
8 vectors contain coefficients equal to zero. 

Under this hypothesis, the U statistic might be considered as a multi- 
variate analogue of R? (in a univariate model, this U statistic is equal to 
1- R?). For multiple-equation models, the U statistic does not have exactly 
the same characteristics as 1 - R*® since we are considering the determinant 
values of matrices of unexplained variance. 


1/ "Currently included variables" and "current |f{|" refer to variables and 
determinant appearing in a specific step of the stepwise computational procedure. 


2/ The "variance-covariance matrix about the means" refers to the variances 
and covariances of the Y variates before regressions are run. 


a6es 
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TABLE 14 
Effects of Individual Explanatory Variables Upon a Measure of Generalized Unexplained Variance 


at Successive Computational Steps, Phase 1, 1950 and 1960 


1950 model, phase 12) 1960 model, phase 12 
Variables entering Variables entering 
at successive stepsb/ Percent changec/ at successive stepsb/ Percent changec/ 


Urban index ‘ Education 
Real estate : Real estate 


Education : Farm size 


Tenure (managers) : Trading center 


Trading center ‘ Road Class 1 
Road Class 1 ‘ Age level 

Isard index ; Isard index 

Age level , Urban index 
Irrigated farms : Tenure (managers) 


Farm size ‘: Irrigated farms 





a/ Phase 1 involves the elimination of the first dependent variable, Yy> because of singularity. 
b/ One variable enters at each step in the stepwise procedure. 


c/ The percent change figure indicates the amount of residual variance explained at each succeed- 


ing step. Thus, education (in 1950) explains 39 percent of the variance remaining after the 
introduction of population level and capital. Variables appear in order of importance in re- 
ducing total variance. 
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TABLE 15 


Cumulative Percent Explanation of Initial Generalized Unexplained Variance 
at Successive Computational Steps, Phase 1, 1950 and 1960 


1950 model, phase 12 1960 model, phase 12 


Variables entering Cumulative Variables entering Cumulative / 
at successive steps/ percent explainedo/ at successive stepal/ percent explained& 


Urban index F Education 

Real estate : Real estate 
Education ‘ Farm size 
Tenure (managers) 3 Trading center 
Trading center ; Road Class 1 
Road Class 1 : Age level 
Isard index ‘ Isard index 


Age level : Urban index 


Irrigated farms ‘ Tenure (managers) 


Farm size r Irrigated farms 





Phase 1 involves the elimination of the first dependent variable, Y because of singularity. 
One variable enters at each step in the stepwise procedure. 


This measure equals (1 - |{| current/|{| initial) 100 percent where "|}| initial" involves 


the variance-covariance matrix of the Y variates and aT! current" involves the estimated 
residual variance-covariance matrix at a given step in the stepwise procedure. 





These tests were carried out for phase 1 for both 1950 and 1960. The 
unadjusted U statistic under this hypothesis, with 13, 11, and 40 degrees of 
freedom for the 1950 model, is .00616. The adjusted statistic, Z, with 143 
degrees of freedom, is 201.0478. The calculated probability that a table Z 
value is greater than or equal to the calculated Z value is .00061. For the 
1960 model, the unadjusted U statistic, with 10, 11, and 40 degrees of freedom, 
is .01260. The adjusted statistic, with 110 degrees of freedom, is 179.3393. 
The corresponding probability is .00002. 

Not too surprisingly, then, the hypothesis that all coefficients equal 
zero was rather resoundingly rejected for all cases investigated. 

Hotelling's T? statistic can be employed to test the hypothesis that the 
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coefficients of a given independent variable are zero in all equations.— This 
test was carried out for each of the variables in turn. The T? statistics com- 
puted for both the 1950 and the 1960 cases appear in Table 16. For each case, 
Table 16 lists the probability of error if the zero value hypothesis is re- 
jected. For education, Road Class 1, trading center, and real estate, this 
probability is quite low, indicating the explanatory power of these variables. 
For age level and irrigation, the probability is relatively high, indicating 
that these are relatively weak explanatory variables. Tenure, farm size, and 
the urban index have a low probability in one year, but a relatively high proba- 
bility in the other, and can be viewed as mixed cases. 

Stress was not placed on tests of individual coefficients, given the focus 
of this study. However, t values for all variables and equations were computed 
and are presented in Appendix C. 


1/ For an explicit definition of Hotelling's T* statistic in matrix terms, 
see Henry Scheffe, The Analysis of Variance (New York: John Wiley & Sons, 
Ine., 1959), p. 418. 
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TABLE 16 


Calculated Values of Hotelling T* Statistic and Error Probability 
in Testing for Zero Coefficients, 1950 and 1960 


1950 1960 
Independent Hotelling T? Error ,,| Hotelling [* Error.) 
variable statistic probabilit statistic probabilit 


Education 45.39% ; VS 


Road Class 1 74.87% : 32.30* 
Tenure (managers) 38.12* ‘ L541 
Age level 27:83 ‘ 16.63 
Trading center 50.32* c 38. 
Irrigated farms 24.84 “ 12. 
Farm size 22.43 5 40. 
Real estate Say : 46. 
Urban index Dis : 20. 


Isard index 30. } 26. 





a/ Probability of error if the null hypothesis is rejected. The null hypothesis 
states that the variable has zero coefficients in all equations. 


* Statistically significant at the 5 percent level. 
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Interpretations of Results Obtained 

The estimated coefficients obtained are plotted against the midpoints of 
their corresponding income class intervals in Figures 6 through 15. For a 
particular variable of a given model, the relation between the estimated co- 
efficients and income level is shown for both 1950 and 1960. Thus, each graph 
summarizes the results for that variable in 25 equations (14 for 1950 and 11 
for 1960). 

Coefficients are indicated as points. Straight-line approximations are 
drawn between successive points; 1960 coefficients are plotted against appro- 
priately deflated values. Since 1950 has 14 classes and 1960 has 11 classes, 
there will be some lack of correspondence in the comparisons. Further, the 
plot results are not adjusted for different class sizes. This lack of adjust- 
ment for class size is defensible in terms of exhibiting coefficient values 
actually obtained. However, for a comparison between classes of the impact of 
a unit change in an independent variable, it would be sensible to take differ- 
ing class sizes into account. To put this in simple terms, say a particular 
coefficient is .1 for income Class A and .3 for income Class B; then, an in- 
crease of 10 units in the independent variable would lead to a predicted in- 
crease of one income unit in A and three income units in B. However, if it 
were further specified that the Class B interval is three times as large as 
that of A, the increase in income units per thousand dollars spanned by the 
interval will be the same for B as for A. This effect should be kept in mind 
in interpreting eas 

In interpreting results, the following general considerations apply. A 
variable with a positive coefficient increases the frequency of occurrence in 


1/ The open-end income interval, in an adjusted graphing, would have to be 
treated as a separate category. 





Urban index coefficients 
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FIGURE 6. 
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Plot of Estimated Coefficients for Urban Index on 
Midpoints of Income Classes, 1950 and 1960 
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FIGURE 7. Plot of Estimated Coefficients for Farm Real Estate 


on Midpoints of Income Classes, 1950 and 1960 
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FIGURE 8. Plot of Estimated Coefficients for Average Education 
on Midpoints of Income Classes, 1950 and 1960 
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FIGURE 9. Plot of Estimated Coefficients for Tenure (Proportion 


of Hired Managers ) on Midpoints of Income Classes 


1950 and 1960 
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FIGURE 10. Plot of Estimated Coefficients for Trading Center 
(Distance to Urban Settlement) on Midpoints of 
Income Classes, 1950 and 1960 
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FIGURE 11. Plot of Estimated Coefficients for the Proportion of 
Road Class 1 on Midpoints of Income Classes 


1950 and 1960 
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FIGURE 12. Plot of Estimated Coefficients for the Isard Index 
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FIGURE 13. Plot of Estimated Coefficients for Average Age on 
Midpoints of Income Classes, 1950 and 1960 
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FIGURE 14. Plot of Estimated Coefficients for the Proportion 


of Farms Irrigated on Midpoints of Income Classes 


1950 and 1960 
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FIGURE 15. Plot of Estimated Coefficients for Farm Size on 
Midpoints of Income Classes, 1950 and 1960 
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the given income class, and a negative coefficient decreases the frequency of 
occurrence. If a variable exhibits an increasing coefficient as income in- 
creases, moving from negative to positive values, then this indicates that 
increasing the value of the explanatory variable will decrease the number of 
low-income recipients and increase the number of high-income recipients. The 
reverse relation occurs, of course, if the coefficient decreases as income in- 
creases. Other patterns are of interest. Thus, a coefficient may be negative 
at the extremes of the income distribution and positive in the middle income 
classes. An increase in the associated independent variable yields greater 
income equality, though it may have little impact on the average level of 
income. (This information can be viewed as one of the advantages of working 
with a distribution rather than with averages.) 

Implicit in this discussion is a general feature of the coefficient re- 
sults which is of considerable interest. In Tables 10 and 11, the sum of the 
constant term over income classes is extremely close to 100, corresponding to 
a total frequency of 100 percent. The sum of the coefficients of each depend- 


ent variable over income classes is essentially zero. No doubt this reflects 
the very large amount of explained variance as shown in Table roped 

A consequence of this feature is that a change in an independent variable 
yields a new income distribution with income class frequencies summing to the 
proper 100 percent. A further consequence is that each independent variable 
has at least one positive and one negative coefficient over the set of all in- 
come classes. Finally, coefficients for a subset of income intervals can be 


1/ A similar result occurred in some preliminary work using number of income 
recipients, rather than frequencies, as dependent variables. In this case, the 
total number of income recipients appeared as one of the independent variables; 
results involved essentially a zero sum of coefficients for the other independ- 
ents, a sum of 1.00 for the coefficients of total recipients, and a sum of zero 
for the constant term. 
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aggregated so the impact of a change in an independent variable can be viewed 
in broad terms. For example, in the 1950 case, an increase of one year in 
the education level has the following impact, based on Table 10 data. There 
is a drop of 4.8 percent in income recipients receiving $2,000 or less; an 
increase of 2.5 percent in the $2,000-$5,000 class; and an increase of 2.3 
percent in the over-$5,000 class. 

Turning to results for individual variables, the coefficient patterns for 
the urban index, real estate, education, and trading center seem to emerge 
rather clearly and fit expectations rather well. Coefficients for the latter 
variable decline as income increases. This is reasonable, for an increase in 
distance to trading center leads to an increase in low-income recipients, a 
decrease in high-income recipients, and, hence, a decline in income overall. 
For the other variables listed above, there is a general upward movement in 
coefficient value as income increases, indicating a positive shift in income 
distributions as these variables increase. The positive impact of urbanization 
adds some further support to the location hypothesis of T. W. Sclindeec 

The education coefficients show a clearly increasing pattern in both 1950 
and 1960. However, some shift in pattern appears involved, with the 1960 edu- 
cation impact seeming somewhat more pronounced. Such results appear meaningful 
in considering education as an income shifter. One of the difficulties with 
prior measures of the impact of education is that there is no guarantee that 
the underlying population is homogeneous. Thus, it may be found that income 
increases with education; however, if there are differences in innate ability 
involved, both increases may reflect this latter factor. In the present study, 


however, this problem seems much less likely to occur. It seems plausible to 
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argue that there is not much difference in average level of innate ability be- 
tween counties. Hence, the impact of the education variable can be attributed 
plausibly to the education process with much less concern that differences in 
underlying ability are involved. Of course, average education level involves 
an underlying distribution; and, hence, some improvement for both analysis and 
policy purposes can be expected by disaggregation. Thus, in future work the 
education variable might be replaced by a set of education activities, in- 
cluding primary, secondary, and college education; level of extension work; 
and measures of "quality" of education. 

The real estate coefficients exhibit an upward movement with income, 
though the 1950 estimates show rather pronounced variability. Results for 
farm size show a different pattern. There is little trend in value of coef- 
ficient as income increases. This is the case for both 1950 and 1960, with 
one exception occurring in the 1960 results; here, there is a pronounced drop 
in coefficient value for the highest income class. It turns out there is 
little correlation between real estate, measured in dollars per farm, and farm 
size, measured in acres per farm. (This correlation was about .15 in both 
1950 and 1960.) It is likely that very large acreages are associated with 
grazing operations so that the farm size variable may be an index of grazing; 
such an argument might be applicable in explaining results for this variable. 

Results for Road Class 1 indicate that good roads lead to greater equality 
of income. Thea, for both 1950 and 1960, increasing the level of this variable 
reduces income recipients in both the lowest three and the highest three income 
classes. In 1950 the reductions are 11 and 6 percent, respectively; while in 
1960, the respective reductions are 7 and 15 percent. Before road building is 
recommended seriously as an egalitarian device, however, it is worth noting 


that this variable shows a fair amount of negative correlation with distance to 
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trading center--which seems reasonable. The correlation between these variables 
was -.5 in 1950 and -.6 in 1960. The building of good roads will tend to reduce 
distance to trading center, both in terms of over-the-road mileage and by pos- 
sible generation of new trading centers. Hence, it might be best to consider 
these variables jointly in interpreting results. 

Results for the remaining four variables are not particularly enlighten- 
ing. The variable labeled "tenure" is a measure of the proportion of hired 
managers. For 1950 the coefficients exhibit some decline as income increases, 
but little pattern appears for 1960. The Isard index, which measures speciali- 
zation, shows a great deal of variability and little pattern in either 1950 or 
1960. The age variable shows little pattern for 1950 but in 1960 exhibits a 
marked declining trend for upper income groups, indicating some income decline 
with increasing age. Future work might allow for a possible nonlinear relation 
between age and income; this could be done by using several age variables or by 
introducing the squared value of age minus a constant. Finally, the irrigated 
farm variable, measured as a fraction of all farms, shows little in the way of 
pattern for both 1950 and 1960. Future work might experiment with alternative 
measures, such as irrigated acreage as a fraction of all acreage. 

Overall, there is a good deal of similarity in the pattern of behavior of 
a given coefficient for 1950 and 1960. Most of the variables exhibit roughly 
the same graphic relationships in the two periods. However, though patterns 
seem intuitively similar, evidence to be presented below indicates some changes 
in structure may have taken place between 1950 and 1960. 

The evidence emerges as a by-product of a specific application. The re- 
sults developed here can be applied in forecasting income distributions if 
(1) forecasts of explanatory variables are made and (2) adjustments for price 


level changes and underreporting are carried out. 


a ce 


By way of example, forecasts were developed for the distribution of in- 
come for rural farm families in Fresno County. Fresno County was selected be- 
cause it is the largest county in terms of aggregate rural farm income. The 
1950 equations were used to develop Fresno County forecasts for 1950 and 1960. 
The 1950 "forecast" can be viewed as indicative of the goodness of fit of re- 
sults for a specific county in 1950. 

Forecasts presented here are in terms of the 1950 income class intervals. 
Table 17 compares the 1950 forecast to 1950 actual for each income class. 

On an intuitive level, forecasts seem fairly close to actual levels. 
Thus, 8 of the 14 forecasts are within 10 percent of actual, 3 are between 5 
and 10 percent, and 3 are above 10 percent. Rather large discrepancies occur 
for the extreme income classes (the lowest and highest). 

Comparison of 1960 forecasts to 1960 actual values is complicated by 
underreporting of income in 1950 relative to 1960 and by increases in the 
general level of prices. These problems were handled by adjusting income 
intervals and then redistributing number (or frequency) of income recipients 
to the income classes defined for 1960. In particular, available evidence 
indicated a price inflation factor of 1.223 and a general income underreport-— 
ing factor of 1.1, with greater adjustment occurring for lower income = peer 

Results obtained appear in Table 18, which compares 1960 forecasts to 
1960 actual values for Fresno County. It turns out that forecasts are well 
above actual values for lower income classes and well below actual values for 


1/ The price inflation factor was obtained from the consumer price index 
for 1959 relative to 1949; the income underreporting factors from Irving Hoch, 
Income and Taxes (Chicago, Illinois: Chicago Area Transportation Study, 1958), 
p- 31. The approach followed involved the assumption that adjustments were 
uniform within an income class; as a consequence, class intervals were modi- 
fied. Thus, the 1950 income class interval, $1,000-$2,000, was transformed to 
an interval of $1,223-$2,446 on the basis of the price inflation factor of 
1.223. 
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TABLE 17 


Forecasts of Income Distribution Using the 1950 Coefficients 
Fresno County, 1950 and 1960 


Interval: Range 
from lower value Frequency of rural farm families 
to less than falling in income class2 


upper value 1950 actual 1950 forecast 1960 forecast 
thousand dollars 
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a/ Frequency, in terms of proportions. 


1950 forecast 
divided by 
1950 actual 








=i) = 


TABLE 18 


Comparison of 1960 Forecasts (Adjusted for Underreporting) 
to 1960 Actual Income Distribution, Fresno County 


Income class 


Interval: Range 

































from lower value | Frequency of rural farm ey ied Forecast 
to less than falling in income class2 divided by 


upper value actual 


thousand dollars 









1 1 
2 2 
3 3 
4 4 
5 5 
6 6 
7 yu 
8 8 
2) 9 
10 9-10 
11 10+ 





a/ Frequency, in terms of proportions. 





upper income classes. The magnitude of forecast error is about 25 percent, 
given an aggregation into high- and low-income classes. These results suggest 


a change in structure occurred between 1950 and 1960. 
Conclusion 


The first major section of this report examined patterns of rural farm 
income on a county basis. At the methodological level, one of the techniques 
employed was the classification and mapping of income levels by means of a com- 
puter program. It seems likely that there will be considerable growth in the 
use of such computer mapping techniques in regional analysis. In terms of 
substantive results, there is some support for the following conclusions. For 
the state of California, no marked difference in income level occurs between 
farm and nonfarm sectors at the county level. Urbanization is associated with 
increased income levels for both nonfarm and farm populations. Some reduction 
of income differentials appears to be occurring at the county level; that is, 
counties with lowest average income tend to have the highest rate of growth in 
average income. Nevertheless, marked income differentials persist over time. 
For the farm sector, seven counties had high average incomes and five counties 
had low average incomes in both 1950 and 1960, using a high, medium, and low 
classification. In the nonfarm sector, 9 counties were in the high classifi- 
cation and 11 were in the low classification in both periods. Thus, the "income 
problem," in terms of pronounced differences in average level between groups, 
seems more of a regional than a sectorial problem in California. Finally, there 
appear to be regional clusters of relatively equal income growth. For both 
farm and nonfarm sectors, southern California, the southern coast, and a tier 
of counties north of Sacramento County increased markedly in average income 


rank. At the other extreme, San Joaquin Valley and northern California counties 


declined substantially in average income rank. 


i 


The work of the second section can be viewed as an attempt to uncover 
some of the underlying causative factors explaining the income differentials 
discussed here. But beyond that kind of work, deeper probing and alternative 
explanatory frameworks could be considered. This could involve such matters 
as the persistence of income differences over long periods, their relation to 
migration and population growth, and the effectiveness of policies aimed at 
reducing income differentials. 

On the methodological level, the second major section of this report 
applied multivariate analysis to the study of income distributions. For each 
income class, the frequency in that class was related to a common set of ex- 
planatory variables. Good substantive results were obtained in terms of ex- 
plained variance and coefficient estimates for a number of the explanatory 
variables. In particular, results for variables—-education, real estate, 
urbanization, and trading center--seemed reasonable and in line with expecta- 
tions, A noteworthy aspect of the estimates is that, for each independent 
variable, the coefficient sum over income classes totals zero, while the sum 
of constants totals 100. As a consequence, any change in the levels of explana- 
tory variables leaves total frequency unaffected. This is a useful feature both 
in terms of viewing the impact of a unit change in one independent variable and 
in terms of obtaining new distributions by changing levels of all independent 
variables, given estimates of those variables. Estimating levels of independent 
variables is a feature of two important applications: (1) obtaining forecasts 
for specific counties (as exemplified here in the case of Fresno County) and 
(2) estimating income distributions for areas smaller than counties. The latter 
application may have commercial possibilities. Thus, consider a firm marketing 
a consumer product with a high income elasticity of demand; information on small 


region income distributions should be of help in its sales and marketing strategy. 
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Some avenues of future work may be noted. First, a number of new inde- 
pendent variables might be introduced. In particular, some suggested variations 
for education, age, and irrigation were noted above. Second, attempts can be 
made to investigate the underlying structural relationships which are roughly 
indicated by the figures plotting estimated coefficients against income class 
midpoints. An attempt might be made to develop smooth curves for the relation- 
ships and, along with this, to test the hypothesis that a change in structure 
has occurred between time periods. The results obtained for the Fresno fore- 
cast indicate that such a change has in fact occurred. It may be that part of 
the discrepancy involves a statistical problem stemming from the impact of tran- 
sitory (versus permanent) components of income. But, in any event, it seems 
plausible that some underlying changes in structural relationships have in fact 
occurred. This is suggested, in particular, by the shift in pattern noted for 
education between 1950 and 1960. Finally, there are intimations, at least, that 
the analysis developed here can be developed further for use as a policy tool. 
The explanatory variables would then be viewed as policy instruments. In parti- 
sates, they might be used to achieve not only higher levels of average income 
but a "better" distribution of income, presumably reflecting the views of the 


electorate. 
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APPENDIX A 
Problems and Solutions in the Use of Census Income Data 
—— ee ae worurions in the Use or Census Income Data 


A number of problems arise in the use of Census of Population income 
data in economic analysis. Problems handled in this study include (1) the 
distribution of nonresponse units to make 1950 data comparable to 1960 and 
(2) the development of average income estimates accounting for skewness in 
the distributions. Other problems noted in this appendix are (1) the prob- 
lem of differences in U. S. Office of Business Economics and Census measures 
of income and (2) the problem of comparability of family income versus family 
and unrelated income. The discussion of these problems can be of relevance 


in applications of results presented in the body of this report. 


Nonresponse 
An important difference between 1950 and 1960 Census income data is in 

the handling of nonrespondents. Of the sample interviewed, the nonrespondents 
for income data are those units for which some characteristics were recorded 
but income data were not obtained. Nonresponse occurred either because the 
interviewee did not have adequate information or because he refused to divulge 
Lt al 

The 1950 Census listed the number of nonresponse units as a separate cate- 
gory. In the 1960 Census, units in this category "were assigned the reported 
income of persons with similar demographic and economic seavacketeeice 1<* 


1/ Personal correspondence with Henry S. Shryock, Jr., Acting Chief, Popu- 
lation Division, U. S. Bureau of the Census, March 12, 1965. 


2/ Ibia. 
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The 1960 Census used the following characteristics for distributing the nonre- 
spondents: age, sex, family status, color, weeks worked, and major occupation 
ee 

To increase comparability between the 1950 and 1960 income distribution 
data, a procedure was developed for distributing the 1950 income nonrespondents 
in a manner similar to that used by the Census in 1960. 

Unpublished data from the Bureau of the Census were employed in the dis- 
tribution Dubeedaie =! A portion of this data is contained in Appendix Table A-l. 
This table shows the 1960 income distribution of male persons (14 years old and 
over) before and after distribution of nonrespondents for the state of Cali- 
fornia. The income distribution of male nonrespondents, as allocated by the 
1960 Census procedure, is shown. In addition, the ratio of the number "after" 
to the number "before" for each income class is shown. 


The procedure for distributing nonrespondents consisted of the following 


steps: 


1. A matrix of "weights" for the adjustment of each income matrix 
was computed. (Counties correspond to rows and income classes 
to columns.) 

2s ‘These weights were multiplied by the observed number of units 

° 
in each income class interval for each county. 

3. The above products were summed over income classes of the 
respective county. The resulting sum was then compared to the 
actual sum of observation units for that county. 


1/ For a brief description of the 1960 distributing of income wimg oe 
see U. S$. Bureau of the Census, U. S. Census of Population: 1960 ig 1965, 
pp. XXVIII and XXIx. 


2/ Personal correspondence with Henry §. Shryock, Jr. 


Fin 


APPENDIX TABLE A-1 


Male Income Distribution Before and After 
Distribution of Nonrespondents, 1960 


Income class 


Interval: Range Before After 

from lower value distribution | distribution Ratio of 
to less than Nonre- of nonre- of nonre- after to 
upper value spondents pondents before 


thousand dollars 


49,573 504,330 553,903 
42,300 528,895 571,195 
31,404 385,246 416,650 
32,740 424,875 457,615 
37,362 517,745 555,107 
38,538 618,267 656,805 
31,110 544,885 575,995 
41,175 763,505 804,680 
26,588 454,396 480,984 


330,790 4,742,144 5,072,934 





Source: Unpublished data obtained by personal correspondence with Henry S. 
Shryock, Jr., Acting Chief, Population Division, U. S. Bureau of the Census, 
March 12, 1965. 
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4. When the two sums were not equal, each of the adjusted number 
of units in each income interval for each county were multiplied 


by the ratio of the actual sum to the computed sum. 


The result for each county of the above four adjustment steps is to dis- 
tribute the 1950 income nonrespondents over that county's 1950 income distri- 
bution in a manner similar to that used by the 1960 Census. Consequently, 
the sum of the resulting distribution is equal to that of the county's income 
respondents plus nonrespondents. 

To restate the procedure in algebraic terms, the number of observations 


in a given income class interval for a given county is multiplied by a scale 


factor: 
Wepij 
where 
s = sector 
p = income unit (either family or family and unrelated) 
i = county 
and 
j = income class interval. 


Then, W_.. is defined as follows: 
splj 


Oi 





where 


R, = the state of California ratio of number of observations after 
distributing nonrespondents to the number of observations be- 
fore distributing nonrespondents by income class, j. (Numeri- 
cal values of a, appear as the last column of Appendix Table A-1.) 


R, = the ratio of after to before of total California observations 
(and equals 1.070 as shown in Appendix Table A-1) 


and 


= the ratio of total county, i, observations including nonrespond- 
ents to the total excluding nonrespondents for specific sector, 
s, and income unit, p. 


Rf 3 
t, spi 


Estimating Average Income for Skewed Income Distributions 


Given the adjusted observations on county income distributions, average 
income estimates by county and sector were then developed. 

Average income is often computed by cumulative summing of income units 
times midpoints of class intervals and dividing by total units. However, since 
income distributions are usually positively skewed, this approach was not used 
here. Instead, average income was estimated as a function of the midpoint 
estimate and of a measure of skewness computed under the midpoint asvnneton A” 
In the application of this function, the midpoint estimate and skewness measure 


were developed for each distribution of this study. The coefficients of the 


1/ Alternative approaches were considered but posed difficulties. The 
skewed distribution is often transformed to approach a symmetric distribution. 
The usual transformation procedure for income distributions employs the log 
transformation. This is correct when X is the positive population variable 
and Log (X) = Z is in fact normally distributed. This approach is convenient 
and easily adopted. However, it is not appropriate in transforming the vari- 
ous forms of income distributions which appear on a sectorial breakdown. 


An alternative transformation technique is to employ the Sech square 
distribution function. This approach can be applied to a wide range of dis- 
tribution forms. Although application is of a simple nature, a significant 
amount of decision-making time is required in a single application. See 
Peter R. Fisk, "The Graduation of Income Distributions," Econometrica, Vol. 29, 
No. 2 (April, 1961), pp. 171-185. 
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function were given, having been estimated previously using United States 

data. Three forms of the function were employed, with a separate set of co- 
efficients given for farm operator families, nonfarm families, and all families. 
(The results for farm operator families were assumed to be applicable to the 
rural farm family units hove} 

In estimating the functions to be employed, data used were United States 
family data collected and published by the Office of Business Economics (OBE) .2/ 
A nine-year time series was available on three sectors. (This included the 
years 1950, 1951, and 1953 through 1959.) As indicated above, the data per- 
tained to farm operator families, nonfarm families, and all families as defined 
by the OBE and assumed applicable here. Value deflation was carried out, using 
the consumer price index. Differences do exist between the OBE definitions of 
sectors and observation units and the Census definitions. But this was not 
considered to be of critical importance for the problem at hand. The most ob- 
vious assumption involved is that aggregated United States sectorial data are 
roughly appropriate for correction of moments computed from California sec- 


torial data. 


It was hypothesized that the following relations held: 


WW 
> 
° 
+ 
Po 
re 
= 
icp) 
VY 


wn 
i 
wo 
+ 
w 
= 
wn 
VY 


1/ For the open-end upper income classes, it was assumed that the class 
average for the "$10,000 and over" class was $20,000 and that the class aver- 
age for the "$25,000 and over" class was $44,000. (The former applies to 1950 
and 1960 sectors, and the latter applies to 1960 All.) These figures were 
based on application of the Pareto distribution. Cf., Herman P. Miller, Trends 
in the Income of Families and Persons in the United States: 1947 to 1960, U. S. 
Bureau of the Census Technical Paper No. 8, 1963, p. 16. 


2/ U. S. Department of Commerce, Office of Business Economics, Survey of 
Current Business, for the years 1950, 1951, and 1953 through 1959. 
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and by substitution 


Yaad 
Ye ae 
ai 
where 
Yad = the first moment (or mean income estimate) computed under 
the midpoint assumption 
Yai = the first moment computed using each interval's average 
income 
ae = the third deviation moment computed using each interval's 
average income 
and 
Si = the third deviation moment computed under the midpoint 


assumption. 


For purposes of estimation, both sides of the equation are multiplied by 


Yi43 and a disturbance term, UL, is introduced to yield: 


ad z Sat ® oa (Sip Wap “ Ty 


Results obtained for the three cases (farm operator family, nonfarm family, 
and all family) are listed in Appendix Table A-2. 

To measure the "goodness of fit," the coefficient of multiple determination 
was computed for each case. For all three cases, this coefficient was at least 
-99, The standard error of the dependent variable was small in each of the three 
cases. For the farm operator family, nonfarm family, and all family cases, the 
standard errors were $6.62, $3.52, and $3.75, respectively. 


Given the estimates of C. and C, from the fitted equation, averages for 


if 


particular income distributions were obtained using: 
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APPENDIX TABLE A-2 


Estimated Coefficients for Average Family Income as a 
Function of Midpoint Estimate and Skewness 


Estimated coefficients 


Midpoint estimate 
times skewness measure 
c_2 

1 
















Midpoint estimate 
¢ af 
° 








sectors 


b/ 


Farm operator— 


Famil 








1.0435 io 





LS 


Nonfarm 0.9140 10° 







15 






All 7.9020 10° 


a/ Estimates are indicated by the hat (%). 


b/ The results for farm operator families were assumed to be applicable to 
the rural farm family units in this study. 


Sources: 


U. S. Department of Commerce, Office of Business Economics, Survey of 
Current Business, Vol. 43, No. 7 (July, 1963), p. 22. 


Idem, U. S. Income and Output: A Supplement to the Survey of Current 
Business, 1958, p. 161. 


Idem, Income Distribution in the United States: A Supplement to the 
Survey of Current Business, 1953, p. 85. 
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(Co + C)S4) 

and then inserting values of Yad and Si obtained for the particular distribution. 
For the state as a whole, midpoint estimates were fairly close to the ad- 

justed averages estimated here. Thus, for the family and unrelated unit in 1950, 

the ratio of the adjusted average to the midpoint average was .986 for the nonfarm 

sector and 1.065 for the farm sector. On a county level, however, more pronounced 


differences occurred. 


Underreporting of Income and Imputed Income 


Census data differs from OBE data on income; the latter include the familiar 
personal income and national income series issued by the federal government. 

Appendix Table A-3 compares aggregate income estimates for the state ob- 
tained from the 1950 and 1960 Census data to the OBE data for the years 1949 
and 1959, respaetively.=! 

It turns out that the ratio of this study's estimated aggregate income to 
that of the OBE is .80 in 1950 and .90 in 1960. Three factors account for this 
difference: (1) imputed income for owned housing and homegrown food does not 
appear in the Census figures but does appear in the OBE figures; (2) the value 
of services of banks and the income of persons who died or emigrated appears 
in the OBE but not in the Census figures; and (3) some income is underreported 
in the Census data, particularly dividend and interest income. Estimates for 
1950 of the impact of each of these factors in terms of understatement of OBE 
figures consist of the following: .949 for omission of imputed income, .956 
for exclusion of bank services and income of persons who died or emigrated, 


i/ Ibid., Vol. 45, No. 7 (July, 1965), p. 10. 
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APPENDIX TABLE A-3 


Comparison of Aggregate Family Income Estimates, 1950 and 1960 


ne ae 19604 
million dollars 





Census 
Midpoint estimates 


Present study 
averages 


Office of Business 
Economics 


Present study averages 
relative to Office of 
Business Economics 


a/ Year refers to date of Census. Dates of income received are 
1949 and 1959, respectively. 


b/ Aggregate figures were obtained by multiplying average estimates 
for All units by total number of income units as listed in the 
Census of Population. _ 


c/ Estimated by multiplying midpoint estimate times .998, the ratio 
of this study estimates to midpoint estimates for 1960 All family 


income. 


Sources: 


U. S. Bureau of the Census, U. S. Census of Population: 1950. 


Vol. Il, Characteristics of the Population, Part 5, California, 
1952, Dp. O-71. 


Idem, U. S. Census of Population: 1960. Vol. I, Characteristics 
of the Population, Part 6, California, 1963, p. 6-250. 


U. S. Office of Business Economics, Survey of Current Business, 
Vol. 45, Nox 1 (July, 1965), p. 10. 
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and .885 reflecting underreporting of income. The product of these fractions 
is .803, approximately that obtained as the overall ratio here. The increase 
of the ratio to .90 in 1959 probably reflects less duputed income from home- 
grown food and increased accuracy of measurement of income from dividends and 
interest. 

If income figures corresponding to OBE measures are desired, the ratios 


presented in Appendix Table A-3 can be used as adjustment factors. 


Family Income Versus Family and Unrelated Income 


Comparison of farm and nonfarm sectors using family income does not neces- 
sarily correspond to a comparison of sectors using family and unrelated income. 
This is because the relative proportions of families and unrelated individuals 
differ between the rural farm and the nonfarm sectors. The average income of 
unrelated individuals is well below that of families so that comparisons of farm 
and nonfarm income are affected by choice of income unit. Appendix Table A-4 
exhibits the fraction of families and of unrelated individuals in the family and 


unrelated category by sector for the 1950 and 1960 Census. 


1/ Hoch, op. cit. pp. 7, 30, and 31. 
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APPENDIX TABLE A-4 


Fraction of Families and Fraction of Unrelated Individuals in the Family 
and Unrelated Unit by Sectors, 1950 and 1960 


and unrelated 
Number of families Number of unrelated 
as fraction of individuals as fraction 
families and unrelated of families and un- 
Year and sectors individuals total related individuals total 


1950 

All 

Urban 

Rural nonfarm 


Rural farm 


1960 

All 

Urban 

Rural nonfarm 


Rural farm 





Sources: 


U. §. Bureau of the Census, U. S. Census of Population: 1950. Vol. II, 
Characteristics of the Population, Part 5, California, 1952, p. S-7L. 


Idem, U. S. Census of Population: 1960. Vol. I, Characteristics of the 
Population, Part 6, California, 1963, p. 6-250. 
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APPENDIX B 


County Ordering and County Income Rank 


County Ordering 


Counties were ordered on the basis of location running from north to south 
and are listed in the text tables on the basis of that order. Figure B-1 exhi- 
bits a map of California counties coded in the location order employed here and 


presents an alphabetical list of counties with corresponding location code number. 


County Rank by Average Income 


The average income figures of Tables 2 and 3 were used to develop county 
rankings. Appendix Table B-1 presents nonfarm and rural farm rankings for 1950 
and 1960 and then exhibits the change in rankings defined as the 1950 rank minus 
the 1960 rank. Data limitations pose something of a problem. Thus, the 1950 
data is for family and unrelated income while the 1960 data is for family in- 
come, Again, some counties appearing in 1950 are omitted in 1960. For the 1950 
rural farm sector (using family and unrelated data), the first five counties in 
rank order were Alpine, Ventura, Colusa, Orange, and Glenn. For 1960 (using 
family data), the first five in rank order were Orange, San Mateo, Ventura, Del 
Norte, and Los Angeles. The lowest five counties for 1950 were Placer, Lake, 
Del Norte, Imperial, and Trinity, with Trinity the lowest. In 1960 the lowest 
five counties (of those reported) were Stanislaus, Tuolumne, Mariposa, Cala- 
veras, and Mono. 

Rural farm income tends to be more variable than nonfarm income so that 
greater change in rankings can be expected and in fact occurs. Such change 
is particularly pronounced for counties with small populations, for example, 

Del Norte and Alpine counties. 
The information in Appendix Table B-1 is the source for Figures 4 and 5 


presented in the text. 
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Location Location 






































code code 
County number County . number 
Alameda 35 Lake 18 
Alpine ail Lassen 7 
Amador 29 Los Angeles 54 
Butte 12 Madera 42 
Calaveras 30 Marin 26 
Colusa 17 Mariposa 4l 
Contra Costa 24 Mendocino 10 
Del Norte 3 Merced 40 
El Dorado 23 Modoc 1 
Fresno 44 Mono 32 
MENDOCINO Glenn Alt Monterey 46 
Humboldt 4 Napa 20 
Imperial 5/7 Nevada 14 
Inyo 43 Orange 55 
Kern 50 Placer 22 
Kings 47 Plumas 8 
Riverside 56 
Sacramento 24 
San Benito 45 
San Bernardino 49 
San Diego 58 
San Francisco 36 
36 San Joaquin 28 
ape San Luis Obispo 51 
San Mateo 37. 
Santa Barbara DZ 
Santa Clara 39 


Santa Cruz 38 


Location 


code 
County number 8AN BERNARDINO 
Shasta 6 
Sierra 13 
Siskiyou 2 
Solano 25 
Sonoma 19 
Stanislaus 34 
Sutter 16 
Tehama 9 
Trinity 5: 
Tulare 48 
Tuolumne 33 
Ventura 53 BAN DIEGO 
Yolo 21, 
Yuba 1 


FIGURE B-1. Alphabetical List of Counties with Corresponding Location Code Numbers 


ajc 


APPENDIX TABLE B-1 


County Rankings of Average Income and Changes in Ranks Over Time 
1950 and 1960 


|_+ssssRankings—ss=S=S*~*~S*~drChange in ramkinge 
1950 family and 1950 rank minus 
unrelated income | 1960 family income 1960 rank 
Nonfarm farm Nonfarm farm Nonfarm farm 


21. 










Count 
















Modoc 36 
43 
19 
16 
18 
25 
81 
28 
46 
29 
50 





Siskiyou 






Del Norte 
Humboldt 















Trinity 






Shasta 









Lassen 








Plumas 








Tehama 








Mendocino 








Glenn 







Butte 48 
Sierra 4O 
Nevada 44 














Yuba 
Sutter 


SL 
30 
38 
56 
33 
15 















Colusa 
Lake 


Sonoma 




















Napa 
Yolo 








Placer 
El Dorado 














Sacramento 






Solano 






Marin 






Contra Costa 






San Joaquin 


(Continued on next page.) 
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APPENDIX TABLE B-1--continued. 







[—Ssté‘“‘(SW.CCCRankings”—~—Ct—~“‘(‘CSCCC4iChaangee in rankings | 
unrelated income |1960 family income 1960 rank 
Pa tee ee 
Nonfarm farm Nonfarm farm Nonfarm farm 
1 3 


a 
40 





















Amador 5 


Calaveras 





Alpine 





Mono 


Tuolumne 







Stanislaus 


Alameda 






San Francisco 











San Mateo 





Santa Cruz 


Santa Clara 





Merced 





Mariposa 





Madera 









Inyo 





Fresno 


San Benito 





Monterey 







Kings 


Tulare 










San Bernardino 





Kern 


San Luis Obispo 





Santa Barbara 





Ventura 










Los Angeles 
Orange 
Riverside 
Imperial 
San Diego 


(Continued on next page.) 
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APPENDIX TABLE B-1--continued. 


a/ No data available. 


b/ At least one year not available. This was more pronounced for 1960, so there 
is some tendency for an upward shift in rankings in 1960. Ranks appearing in 
1950 but not in 1960 are: 53, 54, 55, 56, and 57. Some counties do not 
appear in 1960. Their ranks in 1950 were 1, 14, 49, 55, and 57. 


c/ Not applicable. 


Sources: Developed from data in U. S. Bureau of the Census. U. §. Census of 
Population: 1950. 


Vol. II, Characteristics of the Po ulation, Part 5, 


California, 1952, p. 5-71; and idem, U. S. Census of Population: 1960. 
Vol. I, Characteristics of the Population, Part 6, California, 1963, p. 6-250 


<4. 





APPENDIX C 
Some Univariate Results for the Income Determination Model 


In the discussion of the multivariate model of factors determining the 
income distribution, there was a focus on the set of equations involved, viewed 
as a system. For example, there were statistical tests of hypothesized zero 
coefficients for a given independent variable in all equations. Some univari- 
ate results are presented here, both for general interest and for possible 


contrasts and comparisons. 


Calculated t Statistics 

Appendix Tables C-1 and C-2 present calculated t statistics for the 1950 
and 1960 equations, respectively. (Though these test statistics are applicable 
in multivariate models, they involve the consideration of one variable at a time 
in a particular equation. Because this differs from the focus adopted in the 
text, it seemed proper to catalogue these statistics as univariate results.) 
The number of t values significant at the 5 percent level ranges from a low of 
two for irrigation and for tenure and to a high of nine for education, viewing 
the 1950 and 1960 results in combination. All variables except irrigation and 
tenure have at least five significant coefficients. Given a true coefficient 
value of zero, the 25 cases examined could be expected to yield only one such 


result per variable. 


Regressions Using Average Income 


Appendix Table C-3 exhibits regression results obtained by regressing aver- 
age income on the independent variables employed here. In this case, of course, 
there is only one equation per year. The R? values here are .46 and .64 for 1950 


and 1960, respectively. This contrasts with an explained variance of around .99 


=«95< 
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Constant term and 
independent variables 


Constant term 
Education 

Road Class 1 
Tenure (managers) 
Age level 
Trading center 
Irrigated farms 
Farm size 

Real estate 
Urban index 
Isard index 


Constant term 
Education 

Road Class 1 
Tenure (managers) 
Age level 
Trading center 
Irrigated farms 
Farm size 

Real estate 
Urban index 
Isard index 


See supra, Table 1, 


APPENDIX TABLE C-1 


Calculated t-Ratios, 1950 


4 5 


Income class intervala, 
ee ee ee, Se ee ee | 


coefficient estimates 


page 6, for income 


values included in each 


class interval. 


0. 
0. 
a 
1. 
OF 
-0. 
0. 
2% 
1. 
Ls 
Ls 
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Constant term and 





independent variables 


Constant term 
Education 

Road Class 1 
Tenure (managers) 
Age level 


Trading center 


Irrigated farms 


Farm size 


Real estate 


Urban index 


Isard index 


APPENDIX TABLE C-2 


Calculated t-Ratios, 1960 


rer Ce ee es eee Ne ee ee ee 


coefficient estimates 


(Continued on next 
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Constant term and 


independent variables 


Constant term 


Education 


Road Class 1 


Tenure (managers) 


Age level 
Trading center 
Irrigated farms 
Farm size 
Real estate 
Urban index 


Isard index 


a/ See supra, Table 


APPENDIX TABLE C-2--continued. 


Income class intervala 


io Rap ines ahr enna Sesntnnimemneenlneentes 


72 2 ee eee 
-0.917 -0.120 0.199 0.670 1.315 
-0.977 -0.359 0.984 3.030 4.962 
-0.258 1.418 -0.337 -2.523 -1.426 

0.461 1.580 -0. 764 0.571 -0.199 
1.313 | 0.648 0.138 -0.822 -2.326 
1.748 | 1.352 0.524 -1.275 -2.135 
0.436 | 0.175 -1.072 0.068 -0.815 
-0.074 | 2.716 -1.031 -0.243 -3.101 
-0.925 | -2.540 0.930 -0.021 4.523 
1.201 | -0.728 -0.199 0.824 0.899 
-0.948 | 0.422 #277 -2.140 1.439 


page 6, for income values 


included 


in each class interval. 








APPENDIX TABLE C-3 


Results Obtained in the Regression of Average Income 
1950 and 1960 


Coefficients 
independent variables | 1950a/ 1960b 1950 1960 
Constant term 


Education 


Road Class 1 


Tenure (managers) 


Age level 
Trading center 
Irrigated farms 
Farm size 

Real estate 
Urban index 


Isard index 





al Rigs = -4628. 


b/ Ri 960 = -6391- 
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of total variance in the multivariate case as shown in Table 15. In the uni- 
variate case, t values significant at the 5 percent level are obtained for 
education, farm size, and real estate in both 1950 and 1960 and for trading 
center in 1960. However, the farm size coefficient changes sign between those 
years. The urban index, which was an important explanatory variable in the 
1950 multivariate model (based on Tables 15 and 16) does not have a signifi- 
cant t value in either 1950 or 1960 for this univariate case. 

In general, of course, the univariate case yields much less information 


than does the multivariate case. 
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